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MONITORING SIGNAL-TO-NOISE RATIO IN X-RAY DIFFRACTION DATA 
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The work was funded through a grant by the United States government under NIH grant 
P50GM06240703. The United States government has certain rights in this invention. 

BACKGROUND OF INVENTION 

Electromagnetic radiation is used in diffractometric methods to resolve the structure of 
crystalline materials having interatomic distances comparable to the wavelength of the incident 
radiation. For example, in X-ray crystallography techniques a crystal is mounted between an X- 
ray source and an X-ray detector and a narrow monochromatic source beam of X-rays having 
wavelengths around 1 A is directed onto the crystal. Atoms in various planes of the crystal 
diffract the sourcebeam, thereby, generating a plurality of discrete refracted X-ray beams, which 
are detected and characterized with respect to their spatial orientation and intensity distribution. 
The directions and intensities of the diffracted X-ray beams, are monitored both as a function of 
time and the rotational orientation of the crystal. Diffraction patterns comprising a series of 
individually detected diffracted X-ray beams, referredto as reflections, are collected and analyzed 
to provide information related to the structure of the irradiated crystal. Indeed, the observed 
diffraction pattern is uniquely determined by the structure of the irradiated crystal. Essential 
to the collection of useful X-ray diffraction data is the use of high quality crystalline samples 
characterized by a single phase having a well ordered crystalline structure. 

In recent years, X-ray crystallography has proven a powerful technique for determining 
the structures of a wide variety of complex materials including crystals of macromolecules, such 
as purified proteins, peptides, protein-protein complexes, carbohydrates, oligonucleotides and 
nucleic acid - protein complexes. Techniques for collecting X-ray diffraction data are described 
in "Principles of protein X-ray crystallography' by Jan Drenth, Springer-Verlag, 1 994 and "The 
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theintensitiesOofreflecUonsofthediffiactedX-rays and corresponding posh™ 
.ne primary source of olographic data used in the determination of crysta. struchrre. A 
Jtal can be characterized as a three-dimensional translation* sfruchrre of the crystal urn, 
Is in addition to its transanal symmetry, a crystalline stntcmre can be charactenzed by 
symmetries within its unit ceU. In ore case of a protein mo.ecme, mere „ 65 brown ways o 

use in interpreting diffiacdon data. One can measure the intensity 10*1) for each reflectron. 
Toresolveacrystallmestmemre.anelectmmcdensityd^ 

dimensional fimcdon of me coordinate system tied to me axes of me unit cell of ore crystal. X- 
ray diction data can be used «„ calculate a structure factor, which in itself is msufficrenUo 
calculate the electron density. Specifically, me phase of each reflection, commonly calculated 
in terms of phase angle, is required in addition to intensities and positions evtdent m me 
diffraction pattern. 

' ^Wdiffi^onpanemanal^ 

from the measured intensides and posiflons of X-ray beams scattered by an irradiate* crystal. 
Diffraction patterns corresponding «o crystals comprising small molecules (molecular mass < 

„ Specifically, high redundancy aUows phase information to be obtained dfrecfly by evaluaung 
Idonships between observed structure factors and me desired phases Drrec — » 

As directme.hod,,mesememodsdonotdependonmepresenceofheavya.omsora M msha™g 
30 other special optical properties. 
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For larger molecules (molecular mass > 500 Da), such as proteins, peptides and 
oligonucleotides, determining estimates of the phases of each reflection via direct methods is 
feasible. As a result of this limitation, new analytical methods have evolved over the last 

■a Mno Mtimatine the phase angles of reflections generated by 
several decades which provide means of estimating mcp & 

bating crystal, comprising large molecules. The most common methods of reso.vnrg fine 
crystal structures of these larger molecmar mass componnds are multiple isomorphous 
rep lacement(Mm ) memods,mo 1 ecnlarre pl acemen t (MR)memodsand anomalous scattenng 
(AS) techniques. Although these methods provide a means of solving me phase problem , the 
pto e estimates provided are often limited to an incomplete set of reflections. Therefore, 
sobse^uen.improvemem.refinem^ 

necessary* arrive at electron density distributions, which can be used to interpret asample s 



structure. 



30 



is coUec.edandcomparedtomemffracuonpatt^^ 

atom derivative. Specifically, heavy atoms, such as ions or complexes of Hg, Pt, An etc., are 
tocorporatedintoacrysta. inched 

derivative crystals mns, be isomorphic with the native crystal, such mat incorporation of the 
additional atoms does no, significantly disrupt the lattice structure of the crysla! sample. 
Differences in the diffraction patterns corresponding to native crystid and derivative crystal are 
used.ocalema.eestimate.ofmephasesofmeobservedrefiections.Therefore.thepe^^ 

caused by the introduction of heavy atoms into the derived structirre provides a means of 
estimating phase. The changes in diffraction patterns of native and derivative crystals must be 
dearly detectable and large enough to obtain good phase estimates. Therefore, the abrhty o 
arriveataccurateelecttondensirydisMbutionsusingisomorphousreplacementmethodstsm^ 

dependentoncoUeotmgX-mymffractiondaUhav^ 

allow native and derivative diffraction patterns to be quantitatively compared. 

In molecular replacement methods calculated phases of a reference protein are used as 



3 



WO 2004/013683 



PCT/US2003/021988 



15 



Specificany.tereferencep.teinisused.aphasingmodeUoarriveatas.ractureof.he^ 
protein This technique is especially usefttl if the reference and target are structurally related, 
such as homologous proteins. Molecular replacement methods require X-ray diffraction data 
exhibiting high signal-to-noise ratios to achieve accurate electron density distributions and 
structures. 

Anomalous scattered techniques take advantage of the capacity of heavy atoms, such as 
S P CI or metals, to absorbed X-ray radiation. SpecifioaUy, absorption of X-rays by a heavy 
atem followed by remission of light with an altered phase results in Bragg reflections related 
by inversion through tire origin, referred to as Friedel pairs, which are not equal in intensity. 
Measurementsofthedifferencesmm.ensiu^ 

of the phase of these reflections. Typically, the intensity differences between members of a 
Friedel pair are very small, often less than 5%. Therefore, the ability to arrive a. accurate 
structures using anomalous dispersion techniques is highly dependent on collecting X-ray 
dU^ondarahavtogasignal-to-noi^^ 

between members of Friedel pate to be accurately characterized. In addition, measurements of 
the intensities of members of a Friedel pair must be made under very similar experimental 
conditions to ensure that the observed difference in intensity arises from the phase change 
imparted upon X-ray absorption followed by re-emission rather than by changes in the crystal 
structure, physical environment surrounding the crystal or changes in the quality of the X-ray 
diffraction system. Finally, me accuracy of structures determined using anomalous disperse 
techniques can be greatly increased by collecting and analyzing diffractions patterns 
corresponding to a plurality of different incident light wave.engths using multiple wavelength 
anomalous diffraction (MAD) methods. 

25 Currentty.hightmoughputmethodsfo^^ 

as proteins, peptides and oligonucleotides, are greatly needed to provide structural information 
which is complementary to the growingbody of functional data related to the biologic*! activty 
these compounds. Indeed, high throughput methods of macromoleoule structure determmation 
would assistgreatiymftediscoveryanddevelopmentofsmallmoleculepharmaceuticalscapable 

30 ofintemctingwithindividualproteins.pmtemaggregates.carbohydra.es.nucleicacidsorom^ 
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m acr„molecules important in regulating norma! cell mnctionittg and disease pathways, in 
particmar, single-wavelength anomalous dif&action (SAD) has a great dea. of prom.se for hrgh 
tooughputstmcturede.erminationoflargermole.ules. Firs., this technique is more eeononuc 
andmoreefficien«manomerme.hods,sincemenecessityofpreparingselenium-^ 
or heavy-atom derivatives is eliminated. Second, me technique does not require expensrve 
mnablesynchrotronX-myradiationsources. Rather, single-wavelength anomalous scattenng 
is compatible with home X-ray sources, such as Cu-K„ sources, which are readily available and 
inexpensive. Further, these methods do not require time consuming multi-wavelength 
measurements. Single-wavelengm anomalous scattering, therefore, has thepotentialtoprov.de 
a significant tool in large-scale macromolecnlar structural analysis associated with structural 
genomics. 

Although X-ray dif&actometric techniques are capable of determining the crystal 
structures of many of compounds, the full potential of the application of this techniques to 
biomolecules is currently no. realized due to substantially limitations related to the signal-to- 
noise ratios of crystallographic data collected using conventional diffiactometric methods. 
Indeed, conventional techniques of collecting and analyzing crystallographic data are mefficrent 
and often lack the signal-to-noise ratio needed for the accurate determination of electron denstty 
distributions of large compounds. First, conventional X-ray diffinctomettic techniques for 
determining structures of large molecules lack reliable and quantitative methods of evaluatmg 
signaRo-noise ratios during and after data coUection. Therefore, it is common practtce to 
completely discard substantial amount of da B upon realization mat the signal-.cnoiserat.os are 
toolowtogeneratereliableelectmndemity^^^ 

.oanomalousscattering.echmqueswhereinmeanomalousdifferencesinin.ensi^am 

and thus high signal-to-noise ratio is essential. Second, conventional analysis methods of 

crys.allogmphicda.au.ito 
artifac.sm.hestmc.umana.ysis^^ 

intensify of members of a Friedel pair discards valuable information related to sources ofno.se 
in the experiment. Finally, convention* X-ray diflmc.ome.ric techniques lack a sens.t,ve and 
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effective means of monitoring and assessing important experimental parameters, such as the 
quality of an irradiated crystal sample or X-ray diffraction system, during data collection. 

It will be appreciated from the foregoing that there is currently a need in the art for 
improved methods of collecting, analyzing and mterpreting X-ray diffraction data which provide 
higher signal-to-noise ratios. In addition, methods of quantifying and evaluating signal-to-noise 
ratios of crystallographic data in real time during data collection are needed. Furthermore, 
methods of signal averaging and statistical methods of analyzing diffraction data are needed to 
improve the usefulness of the X-ray diffraction data generated by diffractometric methods. 

SUMMARY OF THE INVENTION 



Thepresentinventionrelat^ 
of materials by characterizing their electron density distributions. More particularly, the present 
15 invention relates to methods of collecting, processing and interpreting X-ray diffraction data, 
which allow real time evaluation of the signal-to-noise ratio in crystal diffraction experiments. 
The present methods relate to the derivation of statistical indices for monitoring and evaluating 
signal-to-noise ratios in diffraction experiments. In addition, the present invention provides 
memodsofassessingexperimentaluncertaintymX-raydiffractiondataandm 
20 derived from experimental data, which are more sensitive than conventional methods. The 
improved signal-to-noise ratios and error analysis provided by the present invention result in 
more reliable and reproducible electron density distributions, which areuseful for characterizing 
the structural and functional properties of crystal samples. The present methods are especially 
useful for determining electron density distributions and structures for crystals comprised of large 
25 molecules, includingbut not limited to proteins, peptides, protein - protein complexes; protein - 
lipid complexes; olignucleotides; carbohydrates; lipid - carbohydrate complexes and nucleic acid 
- protein complexes. 

In one aspect, the present invention provides methods of monitoring and statistically 
30 evaluating noise levels observed in X-ray diffraction data. In one embodiment, the intensities 
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of diffracted X-ray beams corresponding to symmetry related reflections are measured and 
compared to each other to provide a real time measurement of random and non-random noise. 
The methods of the present invention allow for characterization of an average noise level 
corresponding to an entire X-ray diffraction data set. Alternatively, the methods of the present 

5 inventionprovide^ 

during collection of X-ray diffraction data. Preferred methods of the present invention provide 
a means of evaluating noise levels corresponding to very short data collection time intervals. In 
some applications, the present invention provides a means of analyzing signal-to-noise levels 
corresponding to data collection time intervals less than 10 minutes and preferably less than 2 
10 minutes for some applications. Characterization of noise levels corresponding to short time 
intervals is beneficial because it allows trends in noise as a function of time to be quantitatively 
evaluated. Noise levels obtained by the methods ofthe present invention may be used to provide 
real time measurements of signal-to-noise ratios observed during an X-ray diffraction 
experiment. In addition, noise levels measured by the methods of the present invention may be 
1 5 used to correct diffraction signals, particularly anomalous diffraction signals, for the presence of 
random and non-random noise. Noise levels measured by the methods ofthe present invention 
also provide information valuable for assessing the quality of a given crystal sample and 
degradation rates of a crystal due to exposure to X-rays, changes in temperature or other 
environmental conditions. Further, noise levels measured by the methods of the present 
20 mventionalsoprovideinfonnationvaluableforassessmgmep^ 

system including methods of evaluating the stability of an X-ray detector, the uniformity of an 
X-ray source and/or the stability of a crustal alignment system. 

In another embodiment of the invention, the present invention includes methods of 

25 determining crystal st™^^^ 

noise. The goal of this aspect of the present invention is to provide methods of increasing 
signal-to-noise ratios such that more reliable and reproducible electron density distributions are 
determined. Methods ofthe present invention using X-ray diffraction data corrected for non- 
random and random noise levels are beneficial because more accurate electron density 

30 distributions maybe determined with less signal averaging and redundancy than in conventional 
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^dom and random nee. In an eX ™ ndingtoacentricrefl eeUonpair 

md an acentric refiecdon pa,. The exemp^ ^ ^ „ tatensities 

of the centric reflection pair to ob.au> a to. intensdy dtff 

of*, acentric reuecdonpair to obtain a second intensdy d~ b, J 

d^anddreseeondur^ « ^, ^ ^ fc 

into X-ray diffiacdon data. The metods * J ^ fct density difference 

md * e SeC ° nd "1"^ dJLuon for fhe orystaf. fn addidon, to present 

=rs ====== 

^nrtonoisedatainreaftimeastoditoc— measurements are beurgtaKen. 

htana.ternativee.hodiment.topresentnretodconrpHsesconec^ 
datacomp^tensides — -2~==== 

- — rrCf to = TJL reLion pairs and an avera g e second 
subdacdngto intenstU* ^^^^^desoftopl^ofacenMc 

' =Z=--^ - ~ « — ' 

rela.estotonosefevefmtoX-rayd.ffra „ data . ^ anomalous 

— to *r — :t ir:;r iii i s - — « - - 

scadering signs! cotre^d ^ nos e , ^ ^ for ^ 

intensity differences, whtch may be use. ^ ^ ^ 

crystal. The exemplary method may further mcfude P 

* .• . „,Hiffrmncebv dividing to average first intensity difference »y 
9 average of the first intensity difference oy omuu. B 
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— deviation of the intensities and calculating a second weighted average of tire second 

of the intensities. In addition, the exemplary methods may further compnsmg the step of 
bating a ratio of tire firs, weighted average of tine firs, intensity difference and the second 
weighted average of the second intensity difference. In an alternative embodiment, weigh, 
averages of first and second intensity differences may be calculated using the square of tire 
standard deviation of the measured intensities. 

Inanomeraspect.mepresen.taventionpmvid^ 
noiseratioobservedinanX-raydiffiactionexperiment More particularly, the methods of tire 
present invention provide a statistical index for evaluating average signal-.o-no.se raho for a 
sdecteddamcoUectionpenod. Alternatively^^ 

provide a means of evaluating trends in me signaRo-noise ratio in real time. In a* exem P .ary 
embodiment intensities of symmetry related reflection pairs are measured and stausncaUy 
^yzedtopmvideameasurementofsignal-.o-noiseratio.Aprefen.dappticationo^ 

of presen. invention relates .o memods of evaluating me ratio of anomalous scattenng srgnals 
,omeasurednoiselevels.AnexemplarymemodofMsaspec.ofmepresen.mventionoompnses 

the steps of coUeoting X-ray dif&action data corresponding to the intensities of centnc and 
acentricreflectionpairs.ustogm.ensm^ 
„ noise levels, using intensities corresponding to acentric reflections .o determine anomalous 
scattenng signals and statistic*!* evaluating me determined noise leve!s and anomalous 
scattering signals to provide a measure of signal-to-noise ratio. 

Thepresen. methods of evafuating signal-.o-noise ratios during collection of diffraction 
, 5 da.aarebenef.cia.becausetheyaUowimportantexperimentalpamme.era.oheasaessedmreal 

toeduringda.acollection.Ftat.mep^^ 

time provide a means of determining how much signa! averaging is necessary .0 provide X-ray 
diffraction data capable of generating reliable and producible electron density distribute. 
The ability «o quantitative,, assess me amount of signal averaging and redundancy nectary to 
30 acKeveaccum.edecttondensitydis.ributi^ 
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of X-ray diffraction data collection methods and supports applications of high throughput 
structure determinations. In addition, the methods of the present invention are able to identify 
experimental conditions wherein further signal averaging and redundancy does not improve 
signal-to-noiseratiosoraomaUydecreasessignal-to-noiseratios. Second, the present methods 
of evaluating signal-to-noise ratios in rea! time provide a means of assessing the quality of an 
itradi ated crystal sample. In this context, the quality of an irradiated crystal sample refers to Ate 
uniformity of. the crystal structure, uniformity of me physical environment surronndrng the 
o^fceextentofctystallimtyan^ 

mosaics Observed trends in signal-to-noise ratio may be related to changes in crysta! quahty 

and, therefore, may provide a means of assessing when a change in crystal sample is necessary, 
inanexemplaryembodimenacrexamp.e.anewcrysmlsampleisdeemednecess^whenme 

observed signal-to-noise ratios fall below a specified threshold value, such as a value of stgnaU 
to-noise of 1.67. The value of this threshold may depend on a number of variable include the 
structure of the crystal, me anomalous scattering power of the crystal and the X-ray diffraction 
system employed. Third, the present methods of evaluating signal-to-noise ratios in real time 
provide a means of assessing other experimental parameters important to collecting X-ray 
„ ifir actiondatahavmgmghsignal-,o^^ 

noise ratio may be used to identify instabilities in the X-ray source, ctystai alignment system or 
X-ray detector. 

Tbepresentinventionalsoprovidestatistica^^ 
X-ray diffraction data sets, which improve observed sig naI-.o-noise ratios. In an exemplary 
method, diffraction patterns are collected in discrete diffraction data sets corresponds to 
intensity distributions and positions of reflections acquired over different time intervals and/or 
for different crystal samples. The signal-.o-noiso ratios of individual diffraction data sets are 
evaluated using the methods of the present invention. Next, two or more discrete dato sets are 
merged andmesignal-to-noiseratiosofthecombineddata sets are determined and compared to 
the signal-to-noise ratio of the discrete diffraction data sets. In this manner, the signal-to-no,se 
mtio is calculated for all possible combinations of the discrete diffraction data sets. 
Combinations of data sets are identified wherein tire merger of two or more data set, results m 
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a merged set may be provided when sources of noise in the data have opposing effects and, 
hence cancel each other out. This aspect ofthe present invention enables me maximum amount 
ofusemlmfonnadoninadiffercUondatasetorplumliryofdatase^tobeextmctedanduttl^ 

in determination of electron density distributions. This aspect of me present invention ,s 
applicable to any X-ray diffraction technique where high signal-to-noise ratio is desnable, 
particularlysmglewaveleng.hanomalousscaneringtechniques.multiplewavelengm anomalous 

scattering techniques, multiple isomorphous replacement memods and molecular replacement 

methods. 

The concept of separating an overall dam set into its data subsets and evaluating the 
statistics of these data sobs* individually in order to identify and separate noise and srgnal 
,evels as described and claimed .herein, opens up the possibility of monitoring the signal and 
noise levels in any types of data sets containing two or more data subsets in an overall 
experimental data set. For example, if within the overall data set there are subsets of data of a 
theoretically equal vahre, men the da,a within mis type of data set can be used to produce a true 
measurement of noise level in the data, since in the real life experiment the theoretical vahre is 
never acmeved due to ^perimentid and insWrmental errors ryped of data which are theoretically 
different can then be used to measure me signal (with noise) level. By comparing the overall 
magnitudes of these two types of data, one can then monitor the signal-to-noise ratio and 
determine the usefulness ofme data. If there is a signal in the data, bu, the signal-to-noise raho 
isnothigh enough, ane may want, o collect additional data to see if the ^-.o-noise mtio can 

statistically evaluate the quality of experimental data during collection is a srgmfican. 
improvementindetermmingifanexperimentiscompleteorifsuchanexperimentcanor cannot 

prnduceasuccessfulresultThepresentinventionprovidesaway to carryoutsuch an evaluation 
for most types of experimental data. Other types of experiments which can be benefitted by the 
memods of the present invention include, but are not limited to, the neutron and electron 
diffraction experiments from crystafline materials and any experiments fha, have dam wrth the 
above-described characteristics. 
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fcaddifionK^oos ^eringme^eo.^^of*.^^ 

.^method Thebenefitsoftbetacreasedrignal-to-noiserano 

orovided by the present invention, however, are grea 

Ifively LaU signals or changes in signals ,o determine crystal suture, soch as s^e 

ZIZ — — — muMple wavelens<h m 7 

re^X — isor.rorphousrep.ace.nenr— , and — rep—t nredrods 

^nes, a better comparison of diffraction data corresponding ,o native and denvafive 
crystals may beperformed using diffraction data having high signaUo-no.se ratros. 

^cLatrnganomaiousscanermg^^ 

L anlalons scattering signai-to-noise is below an anon,a,oos 
Orreshoid.mapreferredembodun^^^ 

^al-to-noiseratiootX-raydiffiactiondata. An exempt method cnmprisng the steps ofi 

r f ol first se, of intensities; (2) measuring a second se, of intensities correspond^ to 
ratio oairs and a plurality of acentric reflechon parrs and 

pl ural,ty of centnc reflection P-s an P orthesecondsetofinteositte s 

calculaungasecondanomalousscattenng srgnal to noise ratio tor 

signal-to-noise ratio. 

TnemventionisfurfireriUus^ 
) exemplary claims. 
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BRIEF DESCRIPTION OF DRAWINGS 
Figs. 1 A-1D are the graphs representing the 60°, 120' and 180° crystal orientations of 
Zn-free insulin data sets, respectively. 

Fig. 2 is a representation of the electron density around Leu 1 IB of Zn-free insulin. 

DETAIL DESCRIPTION OF THE INVENTION 
Hereinafter, the following definitions apply: 

W and -noise level" are used synonymously and refer ,o Are different in the intensities 
of a pair of eentrie reflections and is mathematically represented by the equation: 

noise= l\- wherein/* and/ are centric reflections. 

Noise may comprise random noise, non-random noise or a combination of random and non- 
^ndomnoise. Noise may originate from a wide variety of sources, h, one aspect of Ore present 

of a crysta, .ample, anisohtoes in me physical environment of a oystal sample, variatrons . 
Are intensity of the incident X-ray beam, variations in the sensitivity of a X-ray detector, and 
variations in me physical orientation of a crystal sam P .e. Noise may change significantly wth 
toe for example due to degradation of a crystal samp.e. Alternatively, noise may reman, 
substantially constant over a selected diffraction data collection interval. 

"Signal" refers to any detectab.e output which is distinguishable ftom noise. In certain 
tomtensitydismbutionofoneormorerene^ 

reflections or Bijvoe, differences of acentric reflections. In one embodiment of tire present 
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Jin, tire physical environment of a crystal sample can be isofroptc, anrsohoptc or have 
^ons which are isotiopic or atopic. In some app.ica.icns, tire phystcal envrronment 
snrrounding a crystal sample comprises frozen or partially frozen mother honor. 

••Centric reflections" refer to a pair of diffracted X-ray beams relate! by symmetry such as 
inversion throng me origtit or rotation, symmetry, mat are also symmetric mates. * *«* 
ambers of a centric reflection pair exhibit identical intensities. Therefore, centnc reflecons 



obey the Friedel relationship: 



1 Z ~ 1 - . wherein / ° and / °_ are the intensities of centric reflections. 

pairs of the present invention may include symmetry related reflection pairs that exhtbtt non- 
Lntica, intensifles due to the presence of noise. Centric reflections useflt, for pracUcmg tite 

by inversion through the orignt or rotationaUy re,ated reflections, such as 2, 3- 4- and 6- fo d 
s „ reflections, m me present invention, differences in the intensities of ^ a 



set. 



-Acentric reflections" refer to pair of diffracted beams related by inversion through ongm tira 
reflection pairs do not obey the Friedel relationship and, therefore, 
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j%j a ■ wherein/" and/" are the intensities of acentric reflections. 



The differences in intensities of members of one or more acentric reflection pairs provide 
nation re,a.=d to the phases of reflections obse^ed in an X-ray diffrachon pattern. 

.^onaious scattering signaV refers to a portion of the difference in intensity of a pah of 

t.- t. • * ft™ the absorotion and subsequent re-emission ot A-rays 
acentric reflections which arises from the absorption h ^ +prminine 

<-pu A ^ent invention provides methods of determining 
during the diffraction process. The present invennon p . ^ . - ^ 

• worrited for noise and methods of increasing the ratio of an 
anomalous scattering signals corrected tor noise <m 

anomalous scattering signal-to-noise. 

-X-ray diffiaction data" and "X-ray diffraction data set" are nsed synonymous,, and refers to 

"^'TVZ 

« may inch.de measnrements of me intensities, intensity disttibutions and posmons of 



to-noise ratios. 



"Friedel pair" is a pair of Bragg reflections related by inversion through the origin. When 
anomalous scattermg cannot be neglected, me result is me break down of Fri 
1) * I(-h, -k, -1) and the pairs are more correctly called Bijvoet pairs. 

^jvoetdi ff erence»referstomedi« 

pair. Bijvoet difference may be expressed by the expression: 

,whereinAIistheBijvoetdifference 5 and / + and/_ are the intensities of 



+ 



the Bijvoet pair. 
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tensity difference" refers «o the differed in the intensities of the members of a symmetry 
rela ted reflection pair. Intensity differences use*, in the methods of Are present .nvenflon 

Bijvoet differences. 

ft. precise nature of the invention. It wU. be apparent, however, to Arose of « * the art « 
tte invention can be practiced wiflron, these speeifie detai., Reference in the specficanon to 
» a pre ferred embodiment," "a more preferred embodiment" or "an exemplary embodnnent 
me L ma, a particular feature, structure, or chamcteristic se, forth or described in connection 
wifll me embodiment is included in a, .east one embodiment of the invention. Reference to 
"preferred embodiment," "a more preferred embodiment" or "ao exemplary embodtmen. m 
varions places in the specificahon do not necessarily refer to the same embodiment. 

Thepresentrnventionmlatestomethofcof^^ 
diffraction data, which aflow real time evalnation of me signal and noise data separately in an 
experimenta. data set. More specific^, the methods relate to the derivation of a starishca! 
index for monitoring me signal-to-noise ratio in me diffraction experiment, correcting 
anomalous scattering signais for noise and maximizing me signal-to-noise raho m an X-ray 
diffraction data set. 

ta an exemplary embodiment, the present invention provides methods for extracting a 
W eax anomalous scattering signal flom aX-ray diffraction data se, which call for asep^ 
accurate measurement and evaluation of the signs, and noise in the da* set. Although* 

methods, multiple isomorphous replacement methods and molecuiar replacement methods. 
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Anexemp j f ™Hc and acentric reflections. Intensities of 

involves statistically analyzing the intensities of eentnc and acentric 

centric and acentric reflections pairs may be measured by any means avatlable » the art In a 

pin. invention inc.ude me nse of defractotnetiy, wherem members of a 

ZL S e q „entia,ly (i.e. bacx-to-bacU, An advantage of deflactometiy mefcods - ** 

ZZLJ, of the intensities of members of a Priedel pair are made very Cose m time. 

^vrytnefltodsofthepresentinventio nindudemeuseofareade.Cors.wherema,^ 

a crystal sample. 

1* statistical average of the reflection intensity, A , is routinely used to evaluate , the 
strengmofmeanomalouss™ 

(D 

A - <lI (+ )-I(-)l /I> 

•^,oic t * T it follows from equation (I) that larger 
Since for the anomalous scattering signals I w * 1 «, « i°" OWi > 

I Lcorrespondtosnongeranomalonsscattenngsigma,, »— *~ 



set. 



An important feature of the methods described above is the extraction of a weak 

• ffor example 1% or less of the overall intensity of the signal) from 

anomalous scattering signal (for example, l /o or 

l^lLflte^isH^beneficMmmis^ Spec^e^valt* 
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i • ^ a /lata set However, for A to be a 
f a the stronger the anomalous scattering signal in the data set. n 

a. • ^t^eaualitvofthe diffraction data set 
Rsvm is another indicator routinely used to judge the quality 01 

with the^ower values of Rsym indicating better data quality. Rsym is defined by the expression. 

^=(X[|/-(/)l]/£'> 

wherein lower values of Rsym correspond to better data quality. 

^ indicated by equation U, Rsym tneasures only the axemen! between symmetry 
e.atedrTfllonsBluseofttds.Rsymagahtisnotareliableindexforiud^gfi.enotseleve, 

different crystals or on different runs of the same crystal. All these data may P 
signal and noise levels. 

.d containing heavy atoms conttibufing to anomamus 

d _ ot refiections is no, — ^ — ^llosymmetrica, Sections 
in the presence of anomalous dispersion. However, exceo tion 

ofgroupslandS. For such centtosymmetticalreflecuons ^-\.y Therefore,ut typ 
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^ / AT =1 . - L, = 0) and acentric (Ala - I< +) - V) * °) 
diffraction data set there will be centnc (AI C i<-) 

reflections. 

As crystaUine sutures and their physical environment usually contains stnrctural 

I sucL twinning, small molecmar rotations, disp— , mo.ecu ar stfuctur 
Z« "ending, inctusions, staging fan., and other short-range drsorders. In addrUon, 

_ . at — t 1*0 AI data contain both the anomalous 
presence of anomalous reflections AI, - V) " hi 

Mattering signal and noise. Equations HI and IV illustrate this concept mathematically. 

(my 

AI, = Signal + Noise ^ 
AI,. = Noise 

parameters to be obtained for acentric and centric reflections, respectively: 

(V) 

Aa=<| Al a |/a I > 

(VI) 

Ac = <l AI c |/ai> 

wherein <r, is the standi deviation of the nreasured intensities. As the noise teveis oHhe 
centric and acentric reflections a. expected to be Are same, Ac represents the measure d no, e 



7 .2 

Aa and Ac, (Aa - Ac), or, alternatively, ^ 



(Aa) " - (Ac) - provide an accurate and sensitive 
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ro _e„«of u.eanon.a.ous—g^alcceced for noise. 

define the following pa—, which are useful for evaluating the quahty of an X-ray 

diffraction data set: 

(Vila) 

As = Aa- Ac 
or 



A5=^|(Aa) 2 -(Ac)' 



(Vllb) 



Ac 



(DO 

As 

Pas=—[ 

< > 

AS shown in equations VTIa and VHb, As is the anomalous scatting signs, cotrected for noise. 

of the measured intensity differences for acentric and centric reflection pairs corresponding to 
anomalous scattering signs, plus noise and noise, respective*, which provides a very accurate 
and sensitive index for evaiuating signal-.o-.oise ratios corresponding to tire anomalous 
scatteringsignal.taaaexemplaryen^^ 

,o-noise ratio. Pas is an indication of Ore measured anomalous scattering power of me crystal. 
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The larger the value of Ras, the stronger the anomalous scattering signal, and Ras values 
less than one (Ras < 1) indicate no anomalous scattering signal. The Ras statistical index may 
be used to quantitatively evaluate anomalous scattering signal and noise levels in X-ray 
diffraction experiments and provides a tool for evaluating the quality of diffraction systems 
including the X-ray source, detector and crystal alignment system. Ras is useful in monitoring 
the quality of the data in terms of data collection strategy, instrument settings, choice of data 
processing program and other factors. The Ras index can also be used during data collection to 
evaluate whether enough data has been collected to provide the needed anomalous scattering 
signal strength to solve the structure or whether additional data need to be collected. 

In addition, Ras provides a statistical index which more accurately and more objectively 
evaluates the anomalous scattering signal and noise levels in X-ray diffraction data than Rsym, 
<Vo l >, Aa and other data evaluation schemes currently used by crystallographers. <U<s l > is not 
a signal-to-noise ratio in terms of anomalous scattering and is also subject to the procedure 
during integration. 

Macroscopic crystals, especially protein crystals, are mosaics of submicroscopic arrays. 
As a result of this mosaicity, X-ray beams diffracted from a crystal propagate along a plurality 
of axes forming a cone of X-ray trajectories extending in space away from the crystal sample. 
Therefore, each reflection recorded by an X-ray detector actually comprises an intensity 
distribution in two spatial dimensions. The functionality of such an intensity distribution 
depends on crystal structure and may often be represented in terms of one or more a Gaussian 
functions, Lorentzian functions or combinations of these. The statistical analysis methods 
described above can be improved by taking into consideration the functional form of the intensity 
distribution of acentric and centric reflections. For example, for crystals exhibiting reflections 
having Gaussian intensity distribution, Aa and Ac are more accurately represented by the 
following expressions having a cr, term substituted into the denominator of equations V and VI: 

Aa = <| Ay/oy^ CX) 
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Ac = <| &k\te?> 



(XD 



wherein o \ is the square of the standard deviation of the measured intensities. Therefore, the 
present invention includes alternative statistical treatments of acentric and centric reflection data, 
which depends on the functional form the observed intensity profiles. 

The method described above works well under the assumption that the number of 
intensity measurements N(+) and N(-) are equal. In practice it happens that those numbers are 
not actually the same and whereN(+) andN(-) can differ significantly, therefore making I(+) and 
statistically uneven during the data reduction process, introducing noise to the Bijvoet 
difference, whichbecomessignificantatlowredundancy. Tocorrectsuchunevenmeasurements, 
the parameter corresponding to the measure of the evenness of the Bijvoet pair distribution is 
useful for characterizing anomalous scattering signal to nosie ratios: 

min(iV(+),iV(-)) ^ 
EaS - K max(iV(+) f tf(-)) 

Eas also provides an accurate and sensitive index for evaluating signal-to-noise ratios of an X-ray 
diffraction data set. 

In the absence of noise, rotational symmetry related reflections, such as 2-, 3-, 4-, and 6- 
fold symmetry reflection, are also predicted to have differences in intensity equal to zero. 
Therefore, differences of the intensities of these reflection pairs may also be used to directly 
measure noise levels in X-ray diffraction data. The use of rotational symmetry related reflections 
is especially beneficial in certain circumstances, such as when there is a lack of sufficient centric 
pairs in the data to allow for an accurate estimate of the noise levels using centric reflection data 
alone. However, in order to use this additional intensity data, the diffraction data set needs to be 
first processed in a Laue group with a lower degree of symmetry. The statistics are then taken 
before the data are merged into the correct Laue group. 
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The above-described method was implemented in 3DSCALE, which is a software 
package for experimental error correction using 3-dimensional error models with Free-R tests. 
In 3DSCALE, centric and acentric reflections are flagged but not differentiated during the 
experimental error corrections and scaling. The parameters proposed and described in the present 
5 method are evaluated as output statistics. In particular, during data collection the collected 
frames are integrated synchronously with the data collection error corrections. The scaling 
process is set up to be performed periodically to provide an on-the-fly evaluation and monitoring 
of the quality of data collected over a certain time period. 

1 0 The above-described method was tested using the diffraction data of the Zn-free insulin 

crystal (space group I2 t 3 , a = 78.95 A). The data were collected up to the 2. 1 5 A resolution using 
a Bruker Proteum-R CCD detector mounted on a Rigaku RUH3R rotating anode generator using 
focused (MSC/Blue confocal optics) CuKa radiation. A total of 900, 0.2° oscillation images 
were recorded using an exposure time of 1 minute. The intensities were integrated with Broker's 

15 new data analysis package PROTEUM. Data were scaled using modified 3DSCALE package 
in accordance with the method described above. Three data sets representing 60°, 120°, and 
180° of crystal rotation were generated by scaling 300, 600, and 900 images, respectively. For 
each data set, Aa, Ac, Ras, Pas and Eas were calculated using 10 different resolution shells. The 
calculation results are presented in Tables la-lc below. 

20 

Table 1(a). 60° data set 



Reso. 
Shell(A) 


Rsym 


<I/oV> 


Aa 


Ac 


Ras 


Pas 


Eas 


4.78 


0.0248 


33.77 


4.35 


2.33 


1.87 


.0.0598 


0.931660 


3.76 


0.0235 


33.70 


3.68 


2.16 


1.70 


0.0451 


0.895819 


3.28 


0.0239 


31.04 


3.22 


2.10 


1.53 


0.0361 


0.873172 


2.97 


0.0244 


28.48 


2.88 


2.01 


1.43 


0.0304 


0.857861 


2.75 


0.0252 


25.70 


2.64 


1.92 


1.38 


0.0280 


0.848586 


2.58 


0.0260 


23.54 


2.44 


1.83 


1.33 


0.0257 


0.840851 


2.45 


0.0267 


22.03 


2.28 


1.63 


1.41 


0.0299 


0.855393 
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2.34 


0.0271 


21.04 


2.16 


1.62 


1.33 


0.0255 


0.862493 




0.0275 


20.37 


2.06 


1.64 


1.26 


0.0207 


0.865128 


/.ID 


0.0276 


19.97 


1.99 


1.63 


1.22 


0.0182 


0.870441 




Table 1(b). 120° data se 


t 






Reso. 
Shell(A) 


Rsym 


<I/<5,> 


Aa 


Ac 


Kas 


r as 


Eas 


4.78 


0.0285 


30.21 


5.44 


2.07 


2.63 


0.1115 


0.972757 


3.77 


0.0279 


30.06 


4.45 


2.05 


2.16 


0.0796 


0.950701 


3.28 


0.0282 


27.84 


3.89 


1.96 


1.98 


0.0691 


0.940222 


2.97 


0.0289 


25.61 


3.45 


1.85 


1.87 


0.0627 


0.931736 


2.75 


0.0297 


23.25 


3.13 


1.76 


1.77 


0.0586 


0.927408 


2.58 


0.0306 


21.36 


2.86 


1.70 


1.69 


0.0546 


0.922546 


2.45 


0.0314 


19.95 


2.68 


1.62 


1.65 


0.0529 


0.923495 


2.34 


0.0319 


19.07 


2.50 


1.67 


1.49 


0.0431 


0.915831 


2.25 


0.0322 


18.44 


2.35 


1.66 


1.42 


0.0374 


0.909613 


2.15 


0.0324 


18.11 


2.23 


1.53 


1.46 


0.0387 


0.909807 



Table 1(c). 180° data set 



Reso. 
Shell(A) 


Rsym 




Aa 


Ac 


Ras 


Pas 


Eas 


4.78 


0.0295 


25.44 


5.70 


1.66 


3.43 


0.1586 


0.915457 


3.76 


0.0293 


25.38 


4.51 


1.69 


2.67 


0.1111 


0.892230 


3.28 


0.0300 


23.77 


3.89 


1.68 


2.32 


0.0933 


0.877214 


2.97 


0.0307 


22.16 


3.54 


1.64 


2.16 


0.0858 


0.866732 


2.75 


0.0316 


20.37 


3.26 


1.62 


2.01 


0.0805 


0.859751 


2.58 


0.0325 


18.92 


3.03 


1.60 


1.90 


0.0759 


0.853574 


2.45 


0.0333 


17.81 


2.89 


1.58 


1.83 


0.0733 


0.850606 


2.34 


0.0338 


17.11 


2.71 


1.72 


1.58 


0.0579 


0.838507 
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2.24 


0.0342 


16.59 


2.56 


1.76 


1.45 


0.0480 


0.828263 


2.15 


0.0344 


16.32 


2.41 


1.88 


1.28 


0.0323 


0.821201 



Turning now to Figure 1, the triangles, circles and stars on the data lines correspond to 
5 the 60°, 120°, and 1 80° Zn-free insulin data sets, respectively. Figure 1 A illustrates the values 
of Ras versus resolution. It can be seen that all three data sets have the signal/noise ratio Ras> 
1.0. Ras improves with increasing redundancy from 60° data set to 180° data set. Figure IB 
illustrates the values of Pas versus resolution. Figure 1C illustrates the values of Aa and Ac 
versus resolution. Ac indicates the noise level in the data. Figure ID illustrates the values of Eas 
1 0 versus resolution, indicating that the Bij voet pairs and 1^ are not evenly measured in the data. 
Eas of the 180° data set drops at high resolutions shells counts for the decrease of the data quality 
compared with the 120° data set. 

The Ras values provided in Table 1 and shown in Figure 1 A illustrate that all three data 
15 sets contain anomalous scattering signals that are clearly above the noise level. However, 
different resolution cutoffs must be used for successful phasing: 3. OA for the 60° data set and 
2.2A for the 120° and 180° data sets, assuming a desired Ras value above 1.5. The Pas plot, 
shown in Figure IB, shows the measured anomalous scattering power in the data sets. 

20 As shown in Figure 1C, Aa is above 2.0 at the highest resolution. However, without the 

knowledge of Ac, which is significantly off from the theoretical 0.0 as shown in Figure 1 C, it is 
not possible to give a meaningful evaluation of the anomalous scattering signal. With the 
increasing redundancy, the anomalous scattering signal improves from the 60° data to the 1 80° 
data. It can be seen that beyond 2.5 A the plots become erratic and the 1 80 ° data does not show 

25 much improvement compared with the 120° data. Figure ID shows that Eas is significantly off 
from ideal 0.0 for all sets. Eas of the 1 80° data is worse than that of the 120°C data, which may 
introduce noise to the Bij voet difference that cancels out the improvement due to the increase of 
redundancy. In fact, the whole data was collected by a single consecutive 1 80° G> scan, which is 
not optimized for collecting Bijvoet pairs. 
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To calculate the electronic density map, the data for the substructure of sulfurs was solved 
by XM using the 60 ° data with 3 .OA resolution cutoff that was suggested from the analysis of the 
Ras and Pas plots obtained in the experiments. We used the phases generated by SAS phasing 
by ISAS2001 to calculate the interpretable electron density map around Leu 11B of Zn-free 
insulin, as shown in Figure 2. The map was calculated using the phases derived by ISAS2001 
with the 60° data set to 3.0A. The 3.0A resolution cutoff was suggested by Ras plot. 

All references cited in this application are incorporated in their entireties by reference 
herein to the extent that they are not inconsistent with the present disclosure in this application. 
It will be apparent to one of ordinary skill in the art that methods, devices, device elements, 
materials, procedures and techniques other than those specifically described herein can be applied 
to the practice of the invention as broadly disclosed herein without resort to undue 
experimentation. All art-known functional equivalents of methods, devices, device elements, 
materials, procedures and techniques specifically described herein are intended to be 
encompassed by this invention. 



26 



